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Abstract 

Given a set of leaf-labeled trees with identical leaf sets, the well-known Maximum 
Agreement SubTree problem (MAST) consists of finding a subtree homeomor- 
phically included in all input trees and with the largest number of leaves. Its variant 
called Maximum Compatible Tree (MCT) is less stringent, as it allows the input 
trees to be refined. Both problems are of particular interest in computational biology, 
where trees encountered have often small degrees. 

In this paper, we study the parameterized complexity of MAST and MCT with 
respect to the maximum degree, denoted by D, of the input trees. Although MAST 
is polynomial for bounded D [H [3], we show that the problem is W[l]-hard with 
respect to parameter D. Moreover, relying on recent advances in parameterized 
complexity we obtain a tight lower bound: while MAST can be solved in 0(N°^) 
time where N denotes the input length, we show that an 0(N°( D ^) bound is not 
achievable, unless SNP C SE. We also show that MCT is W[l]-hard with respect to 
D, and that MCT cannot be solved in 0(N° < - 2D/2 '>) time, unless SNP C SE. 

1 Introduction 

Throughout this paper, N denotes the set of non-negative integers and, for all n e N, the 
set {1, 2, ... , n} is denoted by [1, n). 

1.1 Agreement subtree and compatible tree 
1.1.1 Trees 

All trees considered in this paper are rooted evolutionary trees, i.e. trees representing the 
evolutionary history of a set of species. Such trees are unordered, bijectively leaf-labeled 

*The paper is a revised version of the conference paper [S]. 
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and their internal nodes have at least two children each. Labels are species under study 
and the branching pattern of the tree describes the way in which speciation events lead 
from ancestral species to more recent ones. 

Leaf labels. For convenience, we will identify the leaves with their labels when the tree 
is understood. Let T be a (rooted evolutionary) tree. The leaf label set of T is denoted by 
L(T). We say that T is a tree on L(T). The size of a tree is defined as the cardinality of 
its leaf set. 

Degree. The (out-)degree of a node in T is the number of its children. The maximum 
degree of T, denoted by A(T), is the largest degree over all nodes of T. 

Parenthetical notation. Parenthetical notation is a convenient way to represent evo- 
lutionary trees. Given d non-empty trees Ti, T2, . . . , Td with pairwise disjoint leaf sets, 
(Ti, T2, . . . , Td) denotes the tree whose root has degree d and admits as child subtrees T\, 
T2, • • • , T d . 

Restriction. For each subset X C L(T), the ( topological) restriction of T to X is denoted 
by T\X. Colloquially, T\X is the tree on X displaying the branching information of T 
relevant to X. Restriction is formally defined by induction as follows. 

(Basis). For each leaf-tree £, £\{£} = £ and £\® is the empty tree. 

(Inductive step). Assume that T is of size at least two: T = (T 1 ,T 2 , . . . ,T d ) with d > 2. 
If X is a subset of L(Tj) for some i G [1, d] then T\X — Ti\X, otherwise, T\X is the tree 
on X whose root admits as child subtrees all non-empty trees of the form Ti\(L(Ti) fl X) 
with i G [1, d]. 

1.1.2 MAST and MCT 

Let T be a collection of trees on a common leaf set. 

Agreement subtree. An agreement subtree of T is a tree T such that, VTj G T, T = 
Ti\L(T). The Maximum Agreement SubTree problem (MAST) consists of finding an 
agreement subtree of T of largest size. In phylogenetics, the maximum size of an agreement 
subtree of T is a useful measure of the similarity of the trees in T [7]. From the point of 
view of the MAST problem, a node v of degree d in an input evolutionary tree represents 
the simultaneous creation of d descendant from the ancestral species represented by v. 
As such events are rare if d is greater than two, the trees that people want to calculate 
maximum agreement subtree for have usually small maximum degrees. 
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Compatible tree. Let T and T' be two trees on a common leaf set. We say that T refines 
T' if T' can be obtained by collapsing a selection of edges of T. A tree compatible with T is a 
tree T such that, VTj e T, T refines TjfL(T). Obviously, agreement implies compatibility. 
The converse is usually false for collections including at least a non-binary tree. The 
Maximum Compatible Tree problem (MCT) consists of finding a tree of largest size 
compatible with T. The MCT problem is more relevant than the MAST problem when 
comparing reconstructed evolutionary trees [TOl [8]. From the point of view of MCT, a 
non-binary node is usually interpreted as a lack of decision with respect to the relative 
grouping of its children rather than as a multi-speciation event. As data sequences are 
getting longer and phylogenetic methods more accurate, the maximum degree of indecision 
in reconstructed trees is expected to decrease to a small constant. 

1.1.3 Previous results 

MAST is polynomial on two trees (see [13] for the latest algorithm) but becomes NP-hard 
on three input trees [1] . MCT is NP-hard on two trees even if one of them is of maximum 
degree three [TT] (see also [TO]). 

Consider now the general setting of an arbitrary number, denoted by k, of input trees. 
Let T = {Ti,T%, . . .Tfc} be the input collection. Let n be the cardinality of the common 
leaf set of the Tj's, let d := minf =1 A(Tj) and let D : = maxf =1 A(Tj). Above, we argued 
about the relevance of solving MAST and MCT on bounded maximum degree trees. Three 
different algorithms were proposed to solve MAST in polynomial time for bounded d [JJ El 
[3]. The fastest of these algorithms [HI [3] run in 0(n d + kn 3 ) time. 

Besides, MCT can be solved in 0(A kD n k ) time [8]. Hence, for bounded k, MCT is FPT 
in D. The same result holds for MAST. Assume that a bound p on the number of leaves to 
be removed from the input set of leaves so that the input trees agree, resp. are compatible, 
is added to the input. Then MAST, resp. MCT, can be solved in 0( min{3 p kn, a p + kn 3 }) 
time, where a is a constant less than three [2j . Thus, both problems are FPT with respect 
to p. 

1.1.4 Our contribution 

We prove that both MAST and MCT are W[l]-hard with respect to D. Furthermore, let 
ip : N — > N be an arbitrary recursive function. Note that the input T is of size O(kn). We 
prove the following. 

(Rl). MAST cannot be solved in ip(D)(kn) o{D) time, unless SNP C SE. 

(R2). MCT cannot be solved in <p(D)(kn) o(2D/2) time, unless SNP C SE. 

Recall that SE [12] is the class of problems solvable in subexponential time and that 
SNP [14] contains many NP-hard problems. Hence, the inclusion SNP C SE is unlikely. 
According to result (Rl), the 0(n d + kn 3 ) time algorithms for MAST [6j [3] are somehow 
optimum. Results (Rl) and (R2) are proved in Sections [2] and [3], respectively. 
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1.2 Parameterized complexity 

In order to clearly prove our intractability results, we recall the main concepts of param- 
eterized complexity [5], together with some recent results. We also introduce the notions 
of linear FPT-reduction and weak fixed-parameter tractability. 

Let S be a finite alphabet. The set of all finite words over £ is denoted by £*, and 
for each word x G £*, \x\ denotes the length of x. A parameterized (decision) problem is a 
subset PCNxS*. Each element of (k, x) G N x £* is an instance of P, k standing for the 
parameter. A yes-instance of P is an element of P and a no-instance of P is an element 
of (N x £*) - P. 

1.2.1 Fixed-parameter tractability and weak fixed-parameter tractability 

The parameterized problem P is called fixed-parameter tractable (FPT), if there exist an 
algorithm A and a recursive function ip : N — > N such that, for each (k,x) G N x £*, 
A decides whether (k,x) is a yes-instance of P in ip(k) time. The parameterized 
problem P is called weakly fixed-parameter tractable (WFPT) if there exist an algorithm A 
and a recursive function ip : N — > N such that, for each (k, x) G N x £*, A decides whether 
(k,x) is a yes-instance of P in time. 

1.2.2 FPT-reduction and linear FPT-reduction 

Let P, Q C N x S* be two parameterized problems and let / : N x S* — > N x £*. 

We say that / is a (many-to-one, strongly uniform) FPT-reduction from P to Q if 
there exist recursive functions g : N x S* — > £* and 7 : N — > N satisfying, for all 
G N x £*: 

1. f(k,x) is computable in ip(k)\x\°^ time, 

2. f(k,x) G Q if and only if (k,x) G P, and 

3. f(k,x) = (j(k),g(k,x)). 

Moreover, if 7 is at most linearly increasing (i.e. if ^(k) = O(k) as k —>■ 00) then we say 
that / is a linear FPT-reduction from P to Q. 

FPT-reductions compose, and preserve fixed-parameter tractability. Linear FPT- 
reductions compose, and preserve weak fixed-parameter tractability. Note that our notion 
of linear FPT-reduction is slightly different from the one introduced by Chen, Huang, Kanj 
and Xia [1] . 

1.2.3 Independent set 

Formally, an (undirected) graph is an ordered pair G = (V, E) where V is a finite set of 
vertices and where E a set of 2-element subsets of V. The elements of E are the edges of 
G. The elements of an edge are called its endpoints. An independent set of G is a subset 
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/ C V such that, for each edge e & E, at least one of its endpoint is not in I. The problem 
of finding an independent set of maximum cardinality in a given input graph plays a central 
role in computational complexity theory. 

Name: Independent Set (IS). 

Instance: A positive integer k and a graph G = (V, E). 
Question: Is there an independent set of G with cardinality kl 

The version of IS parameterized by k is denoted by IS [A;]. This problem is not believed to 
be FPT as it is complete under FPT-reductions for the class W[l] [5]. Moreover, IS [A;] is 
not WFPT either, unless SNP C SE | Theorem 5.5]. 

2 Parameterized complexity of MAST 

The decision version of the MAST problem is: 
Name: Agreement SubTree (AST). 

Instance: An integer q > 1 and a finite collection T of trees on a common leaf set. 
Question: Is there an agreement subtree of T with size ql 

We denote by AST[£)] the version of AST parameterized by D := max^ e -2- A(T). In this 
section, we prove: that AST[D] is W[l]-hard, and Result (i?l) stated in Section 11.1.41 
According to Section ITT21 it is sufficient to present a linear FPT-reduction from IS [A;] to 
AST[D] . 

For each integer p > 1, we introduce the following auxiliary problem: 

Name: Partitioned Independent Set with multiplicity p (PIS P ). 

Instance: An integer k > 1, a graph G = (V, E), and k independent sets Vi, V 2) ■ ■ ■ , 

14 of G of equal cardinality partitioning V. 

Question: Is there an independent set I of G such that I D Vi has cardinality p for all 

ie[i,Jfe]? 

For each instance (k, G, V\, V2, ■ ■ ■ , Vk) of PIS P , the graph G is £;-colorable: the V^'s yield a 
fc-coloring of G. The version of PIS P parameterized by k is denoted by PIS P [A;]. We reduce 

15 [A;] to AST[D] going through PISi[A;]. In the next section, the decision version of MCT 
is reduced to IS going through PIS 2 . 

Lemma 1. IS [A;] linearly FPT-reduces to PISifA;]. 

Proof. Reduce IS [A;] to PISi[A;] in the same way as Pietrzak reduces Clique to Par- 
titioned Clique [15J. Each instance (k,G) of IS is transformed into an instance 
(A;, G, Vi, V2, . . . , Vk) of PISi where G and the V^'s are as follows. 
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Let V denote the vertex set of G. G is the graph on V x [1, A;] whose edge set is given 
by: for all (u, i), (v,j) G V x [1, k], {(u, i), (v, j)} is an edge of G if and only if i ^ j and 
either {u,v} is an edge of G or u = v. For each i G [1, k], Vi is defined as Vi := x {z}. 
It is clear that (k, G, V~i, V 2 , . . . , Vk) is an instance of PISi [A;] computable from (k,G) in 
polynomial time. It remains to check that (A;, G) is a yes-instance of IS if and only if 
(k, G, Vi, V 2 , • • • , Vfe) is a yes-instance of PISi. 

(i/). Assume there exists an independent set I of G such that I fl is a singleton for 
all z G [l,fe]. For each z G [1,A;], let fj G V, be such that J D Vi = {(t>j,z)}. The set 
/ := {vi,V2, • • • , ffc} is an independent set of G with cardinality fc. 

(only if). Conversely, assume that there exists an independent set I of G with cardinality 
k. Write I in the form / = {vi,v 2 , ■ ■ ■ , v^}. The set I := {(f 1, 1), (v 2 , 2), . . . , (t> fc , fc)} is an 
independent set of G and I HVi — {(v^, i)} is a singleton for all z G [1, fc]. 

□ 

In order to clearly prove Theorem [U we first introduce some useful vocabulary. 

Definition 1. Let T and T' be two trees and let L be a subset of L(T) H L(T"). We say 
that T and T' disagree on L ifT \L and T' \L are distinct. 

Assume that L(T) C L(T'). If there exists a subset L C L(T) such that T and T' 
disagree on L then T is not a restriction of T' . Conversely, if T is not a restriction of T' 
then T and T' disagree on some 3-element subset of L(T) [3]. This explains the central role 
played by 3-leaf sets of disagreement in the proofs of Lemmas |2] and [3] below. Note that 
given three distinct leaf labels a, b and c, there are exactly four distinct trees on {a, b, c}: 
the non-binary tree (a, b, c), and the three binary trees ((b, c) ,a), ((a, c) , b) and ((a, c) ,b). 

Theorem 1. lS[k] linearly FPT-reduces to AST[D]. 

Proof. According to Lemma HJ it suffices to linearly FPT-reduce PISi[fc] to AST[D]. Each 
instance (k, G, Vi, V 2 , . . . , Vk) of PISi is transformed into an instance (q, T) of AST where 
q := k and where T is a collection of trees described below. Without loss of generality, we 
can assume that all V^'s (i G [1,&]) have cardinality at least three and that k is at least 
three. 

The collection T . We construct a collection T of gadget trees whose leaf set is the 
vertex set V := Vi U V 2 U • • • U V k of G. 

For each i G [1, k], compute an arbitrary binary tree Bi on Vi. The tree on V whose 
root admits B\, B 2 , . . . , B k as child subtrees is denoted by C: C = (Bi, B 2 , . . . , Bk). Every 
tree of T is obtained by modifying the positions of exactly two leaves of C. 

For all a, b G V with a ^ b, C a ^ denotes the tree on V obtained from C, by first 
removing its leaves a and b, and then re-grafting both of them as new children of the root. 
Formally, C a ^ is the tree 

(fix t(Vi - {a, b}), B 2 \(V 2 - {a, &}), ... , B k \{V k - {a, b}), a, b) . 
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Figure 1: Some of the gadget trees encoding an instance (k, G, V\, V 2 , . . . , V k ) of PISi[fc] 
where k — 3, V\ — {a, b, c, d}, V 2 = {e, f , g, h}, V3 = {i, j, k, 1} and {c, f } is an edge of G. 



We set C := {C} U {C afi : a,b G V,a ^ b}. 

Remark 1. There exist at most two indices i such that Bi\(Vi — {a,b}) is distinct from 
Bi, and since Vi has cardinality at least three, Bt\(Vi — {a, b}) is a non-empty tree for all 
i. 

Let E denote the edge set of G: G = (V, E). For each edge e = {a, b} G E, S e denotes 
the tree on V obtained from C, by first removing its leaves a and b, and then re-grafting 
(a, b) as a new child of the root. Formally, S e is the tree 



The collection of trees T is defined as T := CU{S e : e G E} (see Figure!!]): C is the control 
component of our gadget and the S e 's (e G E) are its selection components. 

Lemma 2 (Control). Let T be a tree with L(T) C V. Statements (i) and (ii) below are 
equivalent. 

(i). T is an agreement subtree of C with size k. 

(ii). T = (ci, c 2 , • • • , Cfc) for some (c h c 2 , . . . , c k ) G Vi x V 2 X • • • X V k . 

Proof. Let (ci, c 2 , ■ ■ ■ , c&) G V\ X V 2 X • • • x V k . Distinct q's appear in distinct child subtrees 
of the root of C, resp. of C a f,- Hence, (ci,c 2 , ■ ■ ■ , c&) is a restriction of C, resp. of C a ,b- 
This proves that (ii) implies (i). It remains to show that (i) implies (ii). 
Assume (i): T is an agreement subtree of C with size k. 



(B, \(Vt -e),B 2 \(V 2 -e),..., B k \(V k - e), (a, b)) . 
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• We first prove that T has height one. By way of contradiction, suppose that the height 
of T is greater than one. Then, one can find three distinct leaves a, b, c G L(T) such that 
T\{a,b,c} = ((a,b) , c). (Indeed, there exists an internal non-root node v of T. Pick a 
leaf c which is not a descendant of v and two descendant leaves a and b of v.) However, 
C a ,b \{o>, b, c} = (a, b, c), and thus T and C a ,b disagree on {a, b, c}: contradiction. 

Since T has height one, there exist k pairwise distinct leaf labels c\, c 2) . . . , c& G V 
such that T = (ci, C2, . . . , c&). 

• We now show that distinct Cj's belong to distinct Vj's. By way of contradiction, assume 
there exist three indices z, ji, j 2 G [l,fc] satisfying jx 7^ j 2 , Cji G and Cj 2 G V*. Since 
k is greater that two, there exists j G [l,fc] such that j {ji,j2}- If c? G V; then 
C\{cj 1 ,c j2 ,c j } = Bi\{c h ,c j2 ,Cj} and if cj £ Vi then Cfjc^, c h , c,} = ((c^, c h ) , cj). In 
both cases, Cl^^.c^.Cj} is a binary tree unlike T\{cj 1 ,Cj 2 ,Cj}. Thus, C and T disagree 
on {cj 1 ,Cj 2 ,Cj}: contradiction. 

Up to a permutation of the Cj's, one has (ci, C2, . . . , c&) G Vj. x V2 x ■ ■ • x Vjfc. This proves 
(m) and concludes the proof of Lemma EJ □ 

Lemma 3 (Selection). Let e G E be an edge ofG and let (c.\, C2, ■ ■ ■ , cj.) G V\ XV2 x • • • xVj.. 
The tree (ci, C2, . . . , c^) «s a restriction of S e if and only if at least one endpoint of e is not 
in {ci,c 2 , ...,c k }. 

Proof. The "if part" is easy. Let us now show the "only if" part. 

Assume that (ci, C2, . . . , Ck) is a restriction of S e and that e C {ci, C2, . . . , Cfc}. Let 
and Cj 2 be the two endpoints of e: e = {c^, q 2 }. Since fc is greater than two, there exists 
i G [1, fc] such that Cj ^ e. The restriction of to {0^,0^, q} equals ((c^, q 2 ) , q), and 
thus ^ disagrees with (ci, c 2 , . . . , c^) on {c ix , c i2 , Cj}: contradiction. This concludes the 
proof of Lemma [3j □ 

Correctness of the reduction. It is clear that (q, T) is computable in polynomial time 
from (k,G,Vi,V 2 , . . . , Moreover, the root of C has degree k, the root of C a ^ has degree 
k + 2, the root of has degree fc + 1, and any non-root internal node of a tree in T has 
degree two. Hence, the maximum degree D over all trees in T is equal to A; + 2: D — O(k). 
Eventually, let us derive from Lemmas [2] and [3] that (k, G, V\_, V2, ■ ■ ■ , V&) is a yes-instance 
of PISi if and only if (q, T) is a yes-instance of AST. 

(if). Assume there exists an agreement subtree T of T with size q = k. The tree T is of 
the form T = (c\, C2, . . . , <%) for some (c\, C2, . . . , cj-) G Vi x V2 x • - • x V& by Lemma [2j 
Furthermore, the set 7 := {ci, c 2 , . . . , c^} is an independent set of G by Lemma [31 and for 
every i G [1, k], I H Vj = {cj} is a singleton. 

(on/y i/). Conversely, assume that there exists an independent set I of G such that JflT^ is 
a singleton for all i G [1, k}. Write I in the form / = {ci, C2, . . . , c^} with (c\, C2, . . . , cjj.) G 
Vi x V2 x • • ■ x Vfc. The tree (ci, C2, . . . , Ck) is both an agreement subtree of C by Lemma[2] 
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and an agreement subtree of {S e : e G E} by Lemma [3l Therefore, (ci, C2, . . . , c*.) is an 
agreement subtree of T with size g. 

□ 

3 Parameterized complexity of MCT 

The decision version of the MCT problem is: 
Name: Compatible Tree (CT). 

Instance: An integer q > 1 and a finite collection T of trees on a common leaf set. 
Question: Is there a tree of size q compatible with T? 

Let CT [2 L-°/ 2 -l ] denote the version of CT parameterized by 2' £) / 2 l, where D : = 
maxxgr A(T). In this section, we linearly FPT-reduce IS [A;] to CT[2 L- /^ ] i n order to 
prove: the W[l]-hardness of the version of CT parameterized by D, and Result (R2) 
stated in Section 11.1.41 PIS2 is used as an auxiliary problem. 

Lemma 4. lS[k] linearly FPT-reduces to PIS2[fc]. 

Proof. According to Lemma dj it suffices to linearly FPT-reduce PISi [A;] to PIS2[A;]. We 
rely on a padding argument. Each instance (k, G, Vi, V2, ■ ■ ■ , 14) of PISi is transformed 
into an instance (k, G, Vi, V2, ■ ■ ■ , Vk) of PIS2 where G and the Vi's are as follows. 

Informally, G is obtained by adding k isolated vertices to G, and each Vi is obtained by 
adding a single one of these new vertices to Vj. More formally, let V denote the vertex set 
of G and let E denote the edge set of G: V = V% U V2 U • • • U Vk and G = (V, E). Let a%, a 2 , 
. . . , ak be k new vertices: for all i, j G [1, A;], Oj is not an element of V, and i 7^ j implies 
ai 7^ a,j. Construct G := (V U {ai, a 2 , . . . , ak}, E), and Vi := Vi U {g^} for each i G [1, A;]. 

It is clear that (A;, G, V%, V%, . . . , Vk) is an instance of PIS2 computable in polynomial 
time from (k, G, Vi, V2, . . . , Vk). It remains to check that (k, G, Vi, V2, . . . , 14) is a yes- 
instance of PISi if and only if (A;, G, Vi, V2, • • • , 14) is a yes-instance of PIS2. 

(only if). Assume that there exists an independent set J of G such that JnVi is a singleton 
for every i G [1, k]. Then I := I U {ai, a 2 , . . . , a^} is an independent set of G, and / fl Vi is 
a doubleton for all i G [1 , A;] . 

(if). Conversely, assume that there exists an independent set I of G such that I n V* is a 
doubleton for every i G [1, A;]. For each z G [1, A;], pick an element Vi in /D Vi distinct from 
Oj. The set 7 := {i> 1, t> 2 , . . . , f^} is an independent set of G, and In Vi — {vi} is a singleton 
for all 16 [1 , k] . 

□ 
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Remark 2. The mapping (k, G, Vi, V 2 , ■ ■ ■ , Vk) 1 — > (k, G, V\, V 2 , ■ ■ ■ , Vk), presented in the 
proof of Lemma [7} induces a linear FPT-reduction from PIS p [k] to PIS p +i[A;] for any 
integer p > 1,. Since lS[k] linearly FPT-reduces to PISi[/c] by LemmaUl IS [A;] linearly 
FPT-reduces to PIS p [/c] /or ever?/ integer p > 1. 

Definitions El [3] and H] introduce gadgets that are used to reduce PIS 2 to CT in the the 
proof of Theorem [2l 

Definition 2. Lei n be a positive integer, let T be a tree on [1, n], and let T\, T 2 , . . . , T n be 
n non-empty trees with pairwise disjoint leaf sets. The tree on L(T\) U L(T 2 ) U • • ■ U L(T n ), 
obtained by replacing each leaf i ofT with Tj is denoted by T[Ti,T 2 , . . . ,T n ]. 

For instance, let T := ((1, 2) , (3, (4, 5)) , 6). For any non-empty trees Ti, T 2 , T 3 , T 4 , T 5 , 
T 6 with pairwise disjoint leaf sets, we have 

T[T 1; T 2 , T 3 , T 4 , T 5 , T 6 ] = ((T U T 2 ) , (T 3 , (T 4 , T 5 » , T 6 ) , 

and in particular, 

T[(2, 3) , 1, (6, 7, 8) , 4, 5, ((9, 11) , 10)] = ((1, (2, 3)) , ((4, 5) , (6, 7, 8)) , ((9, 11) , 10)) . 

Definition 3. For each integer n > 1, R n denotes the binary tree on [l,n], defined recur- 
sively as follows: 

• Ri = 1, and 

• R n = (R n -i, n) for every integer n > 2. 

For instance, one has R 2 = (1,2), R 3 = ((1,2), 3), R 4 = (((1, 2) , 3) , 4), R 5 = 
((((1,2), 3), 4), 5), etc 

Property 1. Let n be a positive integer. Let v 1 , v 2 , . . . , v n be n pair- 
wise distinct labels. A tree with leaf labels in {v 1 , v 2 , . . . , v n } is compatible with 
{R n [v 1 , v 2 , . . . , v n ], R n [v n , . . . , v 2 , v 1 }} if and only if its size is at most two. 

Definition 4. For every integer k > 1, Hk denotes a binary tree on [l,k] with minimum 
height [log A;]; for all i, j 6 [1, A;], H% 3 denotes the tree on [l,k] obtained from H k by 
collapsing all internal edges on the path connecting i and j; denotes the least common 
ancestor of i and j in H^ J . 

For instance, (((1, 2) , (3, 4)) , 5) is a suitable tree H 5 , and 
(((1, 2) , (3, 4)) , ((5, 6) , (7, 8))) is a suitable tree H 8 ; for such trees, one has 
Hl A = ((1,2, 3, 4), 5) and H 3 / = ((1,2), (3,4,5,6), (7,8)). 

Property 2. All internal nodes in H^ J are of degree two, except maybe Xjf whose degree 
is at most 2 [log Af| . 

Theorem 2. IS[Ar] linearly FPT-reduces to CT[2L D / 2 J]. 

Proof. According to LemmaUl it suffices to linearly FPT-reduce PIS 2 [A;] to CT[2L D / 2 J]. 
Each instance (k, G, Vi, V 2 , . . . , Vk) of PIS 2 [A;] is transformed into an instance (q, T) of CT 
where q := 2A; and where T is a collection of trees described below. 
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Figure 2: The trees C and C in the case of k = 5 and n — 4. 

The collection T. We construct a collection T of gadget trees on the vertex set V :— 
V\ U V% U • • • U V k of Cr. Let n be such that has cardinality n for every i G [1, A;]. For 
each 2 G write Vi in the form = {vj, vf, . . . , vf}; := R n [vj , vf , . . . , v™] and 

Bt := R n [v?, ■ ■ ■ , w 4 2 , v\] encode ^. 

Let C := H k [B u B 2 , . . . , B k ] and let C := H k [B x , B 2 , . . . , B k ) (see Figure [2]): C and C 
are the control components of our gadget. 

Let E be the edge set of G: G = (V, E). For each edge e = {t>[, u|} G £7, compute the 
tree S e obtained from [Bi, B 2 , . . . , B k ] by first removing its leaves v\ and and then 
re-grafting (yl,Vj) as a new child subtree of \ % k (see Figure EJ). The S e 's (e G E) are the 
selection components of our gadget. 

The collection of trees T is defined as T := {C, C} U {S e : eG i?}. 

Property [3] below is easily deduced from Property HJ 

Property 3 (Control). Let T be a tree with L(T) C V. Statements (i) and (ii) below are 
equivalent. 

(z). T is a tree of size q, compatible with {C, C}. 
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1 3 4 1 9341 9341 94931 9 3 4 

V{ V{ V\ t>2 t>2 t>2 ^2 ^3 ^3 ^3 ^3 V 4 V t V 4 V l V t v 5 v 5 v 5 V 5 

Figure 3: The tree S r 2 3\ in the case of A; = 5 and n = 4. 

\"l> t '4J 

(u). T o/ the form T = Hj-[(ai, &i) , (a2, 62) > • • • j (flfc, ^fc)] where, for each i G [1, <2j 
and hi are two distinct elements ofVi. 

Property 4 (Selection). Let e G E be an edge of G and let T be a tree of size q compatible 
with {C, C}. Then, T refines S e \L(T) if and only if at least one endpoint of e is not in 
L(T). 

Correctness of the reduction. It is clear that (q, T) is computable in polynomial time 
from (k,G,Vi,V2, ■ ■ ■ ,Vk). Moreover, both C and C are binary trees, and all internal 
nodes in S e have degree two, except maybe whose degree is at most 2 [log Af| + 1 (see 
Property [2]). Hence, the maximum degree D over all trees in T is at most 2 [log A;] + 1, 
and thus 2 L^/2J = o(k). Eventually, it remains to show that: (k, G, Vi, V2, ■ ■ ■ , V&) is a 
yes-instance of PIS2 if and only if (q, T) is a yes-instance of AST. 

(if). Assume that there exists a tree T of size q compatible with T. Let / := L(T). By 
Property [3]/ fl Vi is a doubleton for every % G [1, k]. By Property HJ I is an independent 
set of G. 

(only if). Conversely, assume that there exists an independent set I of G such that iDVi is 
a doubleton for all i G [1, k]. For each i G [1, k], let a« and hi be such that I fl Vi = {aj, hi}. 
The tree T := iffc[(ai, 61) , (a 2 , b 2 ) , ■ ■ ■ , (a k , is compatible with {C, C} according to 
Property [3l Furthermore, T is also compatible with {S e : e G £"} according to Property HI 
We have thus exhibited a tree T of size q compatible with T. 

Remark 3. 2L D / 2 J = O(fc) enough to obtain Result (R2). But, our construction does 
not ensure that 2' Z3 / 2 I is a function of k only. Hence, our reduction is not exactly an 
FPT-reduction yet. Anyway, this can be easily repaired. Collapse 2 [log k~\ — 1 consecutive 
internal edges in B\ to obtain a tree B[ of maximum degree 2 [log k~\ + 1 and add to T the 
tree C' := H k [B[, B 2 , . . . , B k ]. 

□ 
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